2,245 research outputs found

    Feature selection for chemical sensor arrays using mutual information

    Get PDF
    We address the problem of feature selection for classifying a diverse set of chemicals using an array of metal oxide sensors. Our aim is to evaluate a filter approach to feature selection with reference to previous work, which used a wrapper approach on the same data set, and established best features and upper bounds on classification performance. We selected feature sets that exhibit the maximal mutual information with the identity of the chemicals. The selected features closely match those found to perform well in the previous study using a wrapper approach to conduct an exhaustive search of all permitted feature combinations. By comparing the classification performance of support vector machines (using features selected by mutual information) with the performance observed in the previous study, we found that while our approach does not always give the maximum possible classification performance, it always selects features that achieve classification performance approaching the optimum obtained by exhaustive search. We performed further classification using the selected feature set with some common classifiers and found that, for the selected features, Bayesian Networks gave the best performance. Finally, we compared the observed classification performances with the performance of classifiers using randomly selected features. We found that the selected features consistently outperformed randomly selected features for all tested classifiers. The mutual information filter approach is therefore a computationally efficient method for selecting near optimal features for chemical sensor arrays

    High-Dimensional Feature Selection by Feature-Wise Kernelized Lasso

    Full text link
    The goal of supervised feature selection is to find a subset of input features that are responsible for predicting output values. The least absolute shrinkage and selection operator (Lasso) allows computationally efficient feature selection based on linear dependency between input features and output values. In this paper, we consider a feature-wise kernelized Lasso for capturing non-linear input-output dependency. We first show that, with particular choices of kernel functions, non-redundant features with strong statistical dependence on output values can be found in terms of kernel-based independence measures. We then show that the globally optimal solution can be efficiently computed; this makes the approach scalable to high-dimensional problems. The effectiveness of the proposed method is demonstrated through feature selection experiments with thousands of features.Comment: 18 page

    Forward-Backward Asymmetry in Top Quark Production in ppbar Collisions at sqrt{s}=1.96 TeV

    Get PDF
    Reconstructable final state kinematics and charge assignment in the reaction ppbar->ttbar allows tests of discrete strong interaction symmetries at high energy. We define frame dependent forward-backward asymmetries for the outgoing top quark in both the ppbar and ttbar rest frames, correct for experimental distortions, and derive values at the parton-level. Using 1.9/fb of ppbar collisions at sqrt{s}=1.96 TeV recorded with the CDF II detector at the Fermilab Tevatron, we measure forward-backward top quark production asymmetries in the ppbar and ttbar rest frames of A_{FB,pp} = 0.17 +- 0.08 and A_{FB,tt} = 0.24 +- 0.14.Comment: 7 pages, 2 figures, submitted to Phys.Rev.Lett, corrected references and change of tex

    Observation of Exclusive Gamma Gamma Production in p pbar Collisions at sqrt{s}=1.96 TeV

    Full text link
    We have observed exclusive \gamma\gamma production in proton-antiproton collisions at \sqrt{s}=1.96 TeV, using data from 1.11 \pm 0.07 fb^{-1} integrated luminosity taken by the Run II Collider Detector at Fermilab. We selected events with two electromagnetic showers, each with transverse energy E_T > 2.5 GeV and pseudorapidity |\eta| < 1.0, with no other particles detected in -7.4 < \eta < +7.4. The two showers have similar E_T and azimuthal angle separation \Delta\phi \sim \pi; 34 events have two charged particle tracks, consistent with the QED process p \bar{p} to p + e^+e^- + \bar{p} by two-photon exchange, while 43 events have no charged tracks. The number of these events that are exclusive \pi^0\pi^0 is consistent with zero and is < 15 at 95% C.L. The cross section for p\bar{p} to p+\gamma\gamma+\bar{p} with |\eta(\gamma)| < 1.0 and E_T(\gamma) > 2.5$ GeV is 2.48^{+0.40}_{-0.35}(stat)^{+0.40}_{-0.51}(syst) pb.Comment: 7 pages, 4 figure

    Evidence for t\bar{t}\gamma Production and Measurement of \sigma_t\bar{t}\gamma / \sigma_t\bar{t}

    Get PDF
    Using data corresponding to 6.0/fb of ppbar collisions at sqrt(s) = 1.96 TeV collected by the CDF II detector, we present a cross section measurement of top-quark pair production with an additional radiated photon. The events are selected by looking for a lepton, a photon, significant transverse momentum imbalance, large total transverse energy, and three or more jets, with at least one identified as containing a b quark. The ttbar+photon sample requires the photon to have 10 GeV or more of transverse energy, and to be in the central region. Using an event selection optimized for the ttbar+photon candidate sample we measure the production cross section of, and the ratio of cross sections of the two samples. Control samples in the dilepton+photon and lepton+photon+\met, channels are constructed to aid in decay product identification and background measurements. We observe 30 ttbar+photon candidate events compared to the standard model expectation of 26.9 +/- 3.4 events. We measure the ttbar+photon cross section to be 0.18+0.08 pb, and the ratio of the cross section of ttbar+photon to ttbar to be 0.024 +/- 0.009. Assuming no ttbar+photon production, we observe a probability of 0.0015 of the background events alone producing 30 events or more, corresponding to 3.0 standard deviations.Comment: 9 pages, 3 figure

    Combined search for the standard model Higgs boson decaying to a bb pair using the full CDF data set

    Get PDF
    We combine the results of searches for the standard model Higgs boson based on the full CDF Run II data set obtained from sqrt(s) = 1.96 TeV p-pbar collisions at the Fermilab Tevatron corresponding to an integrated luminosity of 9.45/fb. The searches are conducted for Higgs bosons that are produced in association with a W or Z boson, have masses in the range 90-150 GeV/c^2, and decay into bb pairs. An excess of data is present that is inconsistent with the background prediction at the level of 2.5 standard deviations (the most significant local excess is 2.7 standard deviations).Comment: To be published in Phys. Rev. Lett (v2 contains minor updates based on comments from PRL

    Precise measurement of the W-boson mass with the CDF II detector

    Get PDF
    We have measured the W-boson mass MW using data corresponding to 2.2/fb of integrated luminosity collected in proton-antiproton collisions at 1.96 TeV with the CDF II detector at the Fermilab Tevatron collider. Samples consisting of 470126 W->enu candidates and 624708 W->munu candidates yield the measurement MW = 80387 +- 12 (stat) +- 15 (syst) = 80387 +- 19 MeV. This is the most precise measurement of the W-boson mass to date and significantly exceeds the precision of all previous measurements combined

    Observation of the Baryonic Flavor-Changing Neutral Current Decay Lambda_b -> Lambda mu+ mu-

    Get PDF
    We report the first observation of the baryonic flavor-changing neutral current decay Lambda_b -> Lambda mu+ mu- with 24 signal events and a statistical significance of 5.8 Gaussian standard deviations. This measurement uses ppbar collisions data sample corresponding to 6.8fb-1 at sqrt{s}=1.96TeV collected by the CDF II detector at the Tevatron collider. The total and differential branching ratios for Lambda_b -> Lambda mu+ mu- are measured. We find B(Lambda_b -> Lambda mu+ mu-) = [1.73+-0.42(stat)+-0.55(syst)] x 10^{-6}. We also report the first measurement of the differential branching ratio of B_s -> phi mu+ mu- using 49 signal events. In addition, we report branching ratios for B+ -> K+ mu+ mu-, B0 -> K0 mu+ mu-, and B -> K*(892) mu+ mu- decays.Comment: 8 pages, 2 figures, 4 tables. Submitted to Phys. Rev. Let

    Precision Top-Quark Mass Measurements at CDF

    Get PDF
    We present a precision measurement of the top-quark mass using the full sample of Tevatron s=1.96\sqrt{s}=1.96 TeV proton-antiproton collisions collected by the CDF II detector, corresponding to an integrated luminosity of 8.7 fb1fb^{-1}. Using a sample of ttˉt\bar{t} candidate events decaying into the lepton+jets channel, we obtain distributions of the top-quark masses and the invariant mass of two jets from the WW boson decays from data. We then compare these distributions to templates derived from signal and background samples to extract the top-quark mass and the energy scale of the calorimeter jets with {\it in situ} calibration. The likelihood fit of the templates from signal and background events to the data yields the single most-precise measurement of the top-quark mass, \mtop = 172.85 \pm0.71(stat) 0.71 (stat) \pm0.85(syst)GeV/c2. 0.85 (syst) GeV/c^{2}.Comment: submitted to Phys. Rev. Let
    corecore